Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 4 de 4
Filtrar
1.
Int J Data Min Bioinform ; 2(2): 176-92, 2008.
Artigo em Inglês | MEDLINE | ID: mdl-18767354

RESUMO

This study presents three feature selection methods for identifying the metabolite features in nuclear magnetic resonance spectra that contribute to the distinction of samples among varying nutritional conditions. Principal component analysis, Fisher discriminant analysis, and Partial Least Square Discriminant Analysis (PLS-DA) were used to calculate the importance of individual metabolite feature in spectra. Moreover, an Orthogonal Signal Correction (OSC) filter was used to eliminate unnecessary variations in spectra. We evaluated the presented methods by comparing the ability of classification based on the features selected by each method. The result showed that the best classification was achieved from an OSC-PLS-DA model.


Assuntos
Algoritmos , Perfilação da Expressão Gênica/métodos , Espectroscopia de Ressonância Magnética/métodos , Modelos Químicos , Reconhecimento Automatizado de Padrão/métodos , Proteoma/química , Proteoma/metabolismo , Simulação por Computador , Análise Discriminante , Análise de Componente Principal , Reprodutibilidade dos Testes , Sensibilidade e Especificidade
2.
Bioresour Technol ; 99(17): 8445-52, 2008 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-18407492

RESUMO

Rapid methods for the characterization of biomass for energy purpose utilization are fundamental. In this work, near infrared spectroscopy is used to measure ash and char content of various types of biomass. Very strong models were developed, independently of the type of biomass, to predict ash and char content by near infrared spectroscopy and multivariate analysis. Several statistical approaches such as principal component analysis (PCA), orthogonal signal correction (OSC) treated PCA and partial least squares (PLS), Kernel PCA and PLS were tested in order to find the best method to deal with near infrared data to classify and predict these biomass characteristics. The model with the highest coefficient of correlation and the lowest RMSEP was obtained with OSC-treated Kernel PLS method.


Assuntos
Biomassa , Modelos Estatísticos , Espectroscopia de Luz Próxima ao Infravermelho , Calibragem , Análise dos Mínimos Quadrados , Análise de Componente Principal , Análise de Regressão , Madeira/química
3.
Expert Syst Appl ; 35(3): 967-975, 2008 Oct 01.
Artigo em Inglês | MEDLINE | ID: mdl-21472035

RESUMO

High-resolution nuclear magnetic resonance (NMR) spectroscopy has provided a new means for detection and recognition of metabolic changes in biological systems in response to pathophysiological stimuli and to the intake of toxins or nutrition. To identify meaningful patterns from NMR spectra, various statistical pattern recognition methods have been applied to reduce their complexity and uncover implicit metabolic patterns. In this paper, we present a genetic algorithm (GA)-based feature selection method to determine major metabolite features to play a significant role in discrimination of samples among different conditions in high-resolution NMR spectra. In addition, an orthogonal signal filter was employed as a preprocessor of NMR spectra in order to remove any unwanted variation of the data that is unrelated to the discrimination of different conditions. The results of k-nearest neighbors and the partial least squares discriminant analysis of the experimental NMR spectra from human plasma showed the potential advantage of the features obtained from GA-based feature selection combined with an orthogonal signal filter.

4.
IEEE Trans Syst Man Cybern B Cybern ; 36(5): 1128-38, 2006 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-17036818

RESUMO

Due to the development of sensing and computer technology, measurements of many process variables are available in current manufacturing processes. It is very challenging, however, to process a large amount of information in a limited time in order to make decisions about the health of the processes and products. This paper develops a "preprocessing" procedure for multiple sets of complicated functional data in order to reduce the data size for supporting timely decision analyses. The data type studied has been used for fault detection, root-cause analysis, and quality improvement in such engineering applications as automobile and semiconductor manufacturing and nanomachining processes. The proposed vertical-energy-thresholding (VET) procedure balances the reconstruction error against data-reduction efficiency so that it is effective in capturing key patterns in the multiple data signals. The selected wavelet coefficients are treated as the "reduced-size" data in subsequent analyses for decision making. This enhances the ability of the existing statistical and machine-learning procedures to handle high-dimensional functional data. A few real-life examples demonstrate the effectiveness of our proposed procedure compared to several ad hoc techniques extended from single-curve-based data modeling and denoising procedures.


Assuntos
Algoritmos , Inteligência Artificial , Análise por Conglomerados , Compressão de Dados/métodos , Armazenamento e Recuperação da Informação/métodos , Reconhecimento Automatizado de Padrão/métodos , Processamento de Sinais Assistido por Computador , Bases de Dados Factuais , Limiar Diferencial , Transferência de Energia , Transdutores
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...